Subjective evaluations for perception of speaker identity through acoustic feature transplantations
نویسندگان
چکیده
Perception of speaker identity is an important characteristic of the human auditory system. This paper describes a subjective test for the investigation of the relevance of four acoustic features in this process: vocal tract, pitch, duration, and energy. PSOLA based methods provide the framework for the transplantations of these acoustic features between two speakers. The test database consists of different combinations of transplantation outputs obtained from a database of 8 speakers. Subjective decisions on speaker similarity indicate that the vocal tract is the most relevant feature for single feature transplantations. Pitch and duration possess similar significance whereas the energy is the least important acoustic feature. Vocal tract + pitch + duration transplantation results in the highest similarity to the target speaker. Vocal tract + pitch, vocal tract + duration + energy and vocal tract + duration transplantations also yield convincing results in transformation of the perceived speaker identity.
منابع مشابه
Subjective Evaluations for Percept Through Acoustic Feature T
Perception of speaker identity is an important characteristic of the human auditory system. This paper describes a subjective test for the investigation of the relevance of four acoustic features in this process: vocal tract, pitch, duration, and energy. PSOLA based methods provide the framework for the transplantations of these acoustic features between two speakers. The test database consists...
متن کاملWhat makes a good speaker? subject ratings, acoustic measurements and perceptual evaluations
This paper deals with subjective qualities and acousticprosodic features contributing to the impression of a good speaker. Subjects rated a variety of samples of political speech on a number of subjective qualities and acoustic features were extracted from the speech samples. A perceptual evaluation was also conducted with manipulations of F0 dynamics, fluency and speech rate with the sample of...
متن کاملWhen speaker identity is unavoidable: Neural processing of speaker identity cues in natural speech.
Speech sound acoustic properties vary largely across speakers and accents. When perceiving speech, adult listeners normally disregard non-linguistic variation caused by speaker or accent differences, in order to comprehend the linguistic message, e.g. to correctly identify a speech sound or a word. Here we tested whether the process of normalizing speaker and accent differences, facilitating th...
متن کاملWhat makes a good speaker? Subjective ratings and acoustic measurements
The paper deals with qualities contributing to the impression of a “good speaker” – a speaker capable of catching the attention of an audience through her/his way of speaking. Subjective ratings of speaker qualities were correlated with acoustic analyses of samples of speech produced in Swedish parliament debates. Raters reliably differentiated between more and less skilled speakers and reached...
متن کاملSimilar Speaker Selection Technique Based on Distance Metric Learning with Perceptual Voice Quality Similarity
This paper describes a similar speaker selection technique based on distance metric learning. Our aim is selection of a perceptually similar speaker using acoustic features from a multispeaker database. A novel point of the proposed technique is training a transform matrix using the perceptual voice quality similarity between many speakers obtained from a subjective evaluation to convert acoust...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003